Prosodic models and speech synthesis: towards the common ground
نویسندگان
چکیده
Prosodic models have been extensively applied in speech synthesis. However, the necessity of synthesizing prosody has as yet not resulted in a generally agreed upon approach to prosodic modeling. This statement holds for the assignment of segmental durations as well as for generating F0 curves, the acoustic correlate of intonation contours. This paper concentrates on the use and usability of intonation models in speech synthesis. Intonation synthesis can be viewed as a two-stage process, and intonation models differ in terms of the interface they provide between the higher linguistic components and the acoustic prosodic modules. We will review the common ground between intonation models and the constraints imposed by different speech synthesis strategies.
منابع مشابه
Prosodic models, automatic speech understanding, and speech synthesis: towards the common ground
Automatic speech understanding and speech synthesis, two of the major speech processing applications, impose strikingly different constraints and requirements on prosodic models. The prevalent models of prosody and intonation fail to offer a unified solution to these conflicting constraints. As a consequence, prosodic models have been applied only occasionally in end-toend automatic speech unde...
متن کاملProsodic models and speech recognition : towards the common ground . 1
In spite of the claim made by many researchers that prosody is a valuable source of knowledge in speech recognition in particular and in automatic speech understanding (ASU) in general, it has not been used up to now to a considerable extent. Partly, this might be due to the fact that its role is more important in more elaborated speech whereas until recently, the main emphasis was on dictation...
متن کاملطراحی و ارزیابی یک مدل بازسازی گفتار به روش همگذاری واحدهای حساس به بافت نوایی
This paper describes the design and evaluation of prosodically-sensitive concatenative units for a Persian text-to-speech (TTS) synthesis system. Thesyllables used are prosodically conditioned in the sense that a single conventional syllable is stored as different versions taken directly from the different prosodic domains of the prosodically labeled, read sentences. The three levels of the Per...
متن کاملTree grammars as models of prosodic structure
The common ToBI system of transcription assumes a sequential model of prosody. Many linguists argue for a tree structure explaining the synchronization and interaction among prosodic units. Could tree grammars, used previously in syntax-based language modeling, be used to model prosodic trees? We present a method of converting sequential transcripts into trees, and then demonstrate that modelin...
متن کامل